How to judge reusability of existing speech corpora for target task by utilizing statistical multidimensional scaling

نویسندگان

  • Goshu Nagino
  • Makoto Shozakai
  • Kiyohiro Shikano
چکیده

In order to develop a target speech recognition system with less cost of time and money, reusability of existing speech corpora is becoming one of the most important issues. This paper proposes a new technique to judge the reusability of existing speech corpora for a target task by utilizing a statistical multidimensional scaling method. In an experiment using twelve tasks in five speech corpora, our proposed method could show high correlation to the cross task recognition performance and judge the reusability of existing speech corpora correctly for the target task with lower cost.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Analyzing reusability of speech corpus based on statistical multidimensional scaling method

In order to develop a target speech recognition system with less cost of time and money, reusability of existing speech corpora is becoming one of the most important issues. This paper proposes a new technique of applying a statistical multidimensional scaling method to analyze the reusability of a speech corpus. In the experiment using six speech corpora, which contains isolated words and shor...

متن کامل

Multidimensional scaling of systems in the Voice Conversion Challenge 2016

This study investigates how listeners judge the similarity of voice converted voices using a talker discrimination task. The data used is from the Voice Conversion Challenge 2016. 17 participants from around the world took part in building voice converted voices from a shared data set of source and target speakers. This paper describes the evaluation of similarity for four of the source-target ...

متن کامل

The Development of the Multilingual LUNA Corpus for Spoken Language System Porting

The development of annotated corpora is a critical process in the development of speech applications for multiple target languages. While the technology to develop a monolingual speech application has reached satisfactory results (in terms of performance and effort), porting an existing application from a source language to a target language is still a very expensive task. In this paper we addr...

متن کامل

Using Multidimensional Scaling for Assessment Economic Development of Regions

Addressing socio-economic development issues are strategic and most important for any country. Multidimensional statistical analysis methods, including comprehensive index assessment, have been successfully used to address this challenge, but they donchr('39')t cover all aspects of development, leaving some gap in the development of multidimensional metrics. The purpose of the study is to const...

متن کامل

MDS-based visualization method for multiple speech corpora

The purpose of this study is to visualize the similarities between speech corpora. Speech data are indispensable for promoting speech research. A wide variety of speech corpora has recently been developed in many countries. Corpus diversification has given users many choices for corpus selection. In order for users to easily utilize these various corpora, we propose a new feature visualization ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007